Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 1022420200120040081
Phonetics and Speech Sciences
2020 Volume.12 No. 4 p.81 ~ p.90
Building a Korean conversational speech database in the emergency medical domain
Kim Sun-Hee

Lee Joo-Young
Choi Seo-Gyeong
Ji Seung-Hun
Kang Jee-Min
Kim Jong-In
Kim Do-Hee
Kim Bo-Ryong
Cho Eun-Gi
Kim Ho-Jeong
Jang Jeong-Min
Kim Jun-Hyung
Ku Bon-Hyeok
Park ?Hyung-Min
Chung Min-Hwa
Abstract
This paper describes a method of building Korean conversational speech data in the emergency medical domain and proposes an annotation method for the collected data in order to improve speech recognition performance. To suggest future research directions, baseline speech recognition experiments were conducted by using partial data that were collected and annotated.
All voices were recorded at 16-bit resolution at 16 kHz sampling rate. A total of 166 conversations were collected, amounting to 8 hours and 35 minutes. Various information was manually transcribed such as orthography, pronunciation, dialect, noise, and medical information using Praat. Baseline speech recognition experiments were used to depict problems related to speech recognition in the emergency medical domain. The Korean conversational speech data presented in this paper are first-stage data in the emergency medical domain and are expected to be used as training data for developing conversational systems for emergency medical applications.
KEYWORD
conversational speech, speech data, speech recognition, annotation, emergency medical domain
FullTexts / Linksout information
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI)